OpenAI's Voice Engine can create synthetic voices based on a 15-second clip of someone's voice. The text-to-voice generation platform is currently in limited access. Developers testing the platform are required to get explicit and informed consent from speakers, not build ways for individual users to create their own voices, and disclose to listeners that the voices are AI-generated. Example clips generated by Voice Engine are available in the article.
Monday, April 1, 2024OpenAI's Voice Engine is a model that generates speech mimicking a speaker's voice from a 15-second audio sample. It can be used in applications like educational aids, translation, and support for non-verbal individuals. OpenAI is employing a cautious approach to deployment due to potential misuse.